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DETAILED ACTION 
Claim Objections 

1 . Claims 8, 1 5, 1 8, 1 9, 25, and 35 are objected to because of the following 
informalities: each of the above claims depend on themselves. Appropriate correction 
is required. For purposes of examination, the Examiner has assumed the following 
dependencies: Claim 8 depends on Claim 7, Claim 15 depends on Claim 14, Claim 18 
depends on Claim 17, Claim 19 depends on Claim 18, Claim 25 depends on Claim 23, 
Claim 35 depends on Claim 33. 

2. Claims 24, 31 , and 34 are objected to because of the following informalities: 
each of the above claims depend on claims after them. Appropriate correction is 
required. For purposes of examination, the Examiner has assumed the following 
dependencies: Claim 24 depends on Claim 23, Claim 31 depends on Claim 30, and 
Claim 34 depends on Claim 33. 

Claim Rejections - 35 USC § 103 

3. The following is a quotation of 35 U.S.C. 1 03(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

Claims 1-40 are rejected under 35 U.S.C. 103(a) as being unpatentable over 

Noguchi (U.S. Patent No. 5,983,184). 
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In regard to independent Claim 1 (and similarly independent Claim 17), Noguchi 
teaches that in Fig. 4 is shown a sample of an HTML file used in the preferred 
embodiment of the present invention. In this embodiment, the HTML file is input to 
produce a sentence including an embedded command. As is shown in Fig. 4, the HTML 
file in a text form includes tags, <TITLE>, <H1>, <H2>, <H3>, <H4>, <H5> and <H6>. 
The HTML file in Fig. 4 is processed by the display section (133) and is shown to a user 
as a graphical image, as is shown in Fig. 5. The HTML tag begins with the start tag 
<XXXX> and ends with the end tag </XXXX>. Thus, the system can recognize the types 
of respective tags and can extract (parse) them (Col. 1 1 , lines 10-15; Col. 1 1 , lines 20- 
25). Noguchi also teaches producing an embedded command file. The embedded 
command file is produced by the HTML analyzer (125) using two procedures. The 
HTML file shown in Fig. 4 is temporarily converted into a form shown in Table 1 (note in 
Table 1 how the tags are converted to voice attributes, which include speed, pitch, 
volume, and intonation). Unnecessary information, such as "<html>" or "<|MG 
SRC="yamato.gif" >," is removed from the HTML file, and the invalid start tag is 
converted into voice attribute information based on the text attribute/voice attribute 
conversion table (conversion table 143). The end tag is converted into an embedded 
command to return the voice attribute, which has been changed by the related start tag, 
to a default value (Col. 11, lines 63-67; Col. 12, lines 1-4). Noguchi also teaches an 
example of a tentative file (see Table 2). Compare to Claim 1 (and similarly Claim 17), 
"... parsing the document to identify a presence of a selected tag indicating an 
emphasis level, wherein text is associated with the selected tag"). Noguchi also 
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teaches that the voice synthesizer (165) generates a voice signal in accordance with the 
voice parameter produced by the parameter generator (163) (Col. 14, lines 43-45). 
Since the voice synthesization is performed in response to the type of a special word 
that is included in the text, a user can identify the type of the special word nearly by 
listening and without using vision, so that he or she can understand the contents of the 
text (Col. 14, lines 51-56; compare to Claim 1 (and similarly Claim 17), "... responsive 
to an identification of the presence of the selected tag, audibly presenting the text 
using the emphasis level"). Noguchi does not explicitly teach that the selected tag is 
read . . . prior to presenting other text within the document. However, Noguchi does 
teach that in response to using certain predefined function keys, one can maneuver 
about the page in a number of ways, some of which would effectively read the selected 
tags before reading any other text in the web page (see Cols. 17-19). Therefore, one of 
ordinary skill in the art at the time of invention would have found it obvious to have been 
able to quickly locate portions of the text, such as hyperlinks before hearing any of the 
remaining text providing the benefit of audibly locating links to other web sites. 

In regard to dependent Claim 2 (and similarly dependent Claims 14, 18, and 30), 
Noguchi teaches that the communication control (110) controls communications with the 
Web server (60), as is shown in FIG. 3. In FIG. 3, first, information specifying a URL 
input at the user input section (127) is received via the input/output controller (121). 
Based on this information, the Web server (60) is accessed, and an HTML (Hyper Text 
Markup Language) file is received from the Web server (60) (Col. 10, lines 56-62; 
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compare with Claim 2 (and similarly Claims 14, 18, and 30), "... the document is a 
markup language document"). 

In regard to dependent Claim 3 (and similarly dependent Claims 15, 19, and 31), 
Noguchi teaches that the communication control (110) controls communications with the 
Web server (60), as is shown in Fig. 3. In Fig. 3, first, information specifying a URL input 
at the user input section (127) is received via the input/output controller (121). Based on 
this information, the Web server (60) is accessed, and an HTML (Hyper Text Markup 
Language) file is received from the Web server (60) (Col. 10, lines 56-62; compare with 
Claim 3 (and similarly Claims 15, 19, and 31), "... the markup language document is 
one of a hypertext markup language document and a extensible markup language 
document"). 

In regard to dependent Claim 4 (and similarly dependent Claims 16, 20, and 32), 
Noguchi teaches that the communication control (110) controls communications with the 
Web server (60), as is shown in Fig. 3. In Fig. 3, first, information specifying a URL input 
at the user input section (127) is received via the input/output controller (121 ). Based on 
this information, the Web server (60) is accessed, and an HTML (Hyper Text Markup 
Language) file is received from the Web server (60) (Col. 10, lines 56-62; compare with 
Claim 4 (and similarly Claims 16, 20, and 32), "... the document is a web page"). 

In regard to dependent Claim 5 (and similarly dependent Claim 21 ), Noguchi 
teaches in Table 1, an example of the text attribute/voice attribute conversion table (Col. 
12, lines 9-33; compare to Claim 5 (and similarly Claim 21), "... the selected level of 
emphasis is selected based on a type for the selected tag"). 
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In regard to dependent Claim 6 (and similarly dependent Claim 13), Noguchi 
teaches that the Web browser (120) includes a data input/output controller (121), an 
HTML file storage section (123), an HTML analyzer (125), a user input section (127), a 
focus controller (129), a word list (131), a display section (133), a conversion table 
(135), and a focus pointer (139)(Col. 9, lines 34-38; compare to Claim 6 (and similarly 
Claim 1 3), "... the method is located in a web browser"). 

In regard to independent Claim 7 (and similarly independent Claim 23), Noguchi 
teaches that the communication control (110) controls communications with the Web 
server (60), as is shown in Fig. 3. In Fig. 3, first, information specifying a URL input at 
the user input section (127) is received via the input/output controller (121). Based on 
this information, the Web server (60) is accessed, and an HTML (Hyper Text Markup 
Language) file is received from the Web server (60) (Col. 10, lines 56-62; compare with 
Claim 7 (and similarly Claim 23), "... receiving the document"). Noguchi also teaches 
that in Fig. 4 is shown a sample of an HTML file used in the preferred embodiment of 
the present invention. In this embodiment, the HTML file is input to produce a sentence 
including an embedded command. As is shown in Fig. 4, the HTML file in a text form 
includes tags, <TITLE>, <H1>, <H2>, <H3>, <H4>, <H5> and <H6>. The HTML file in 
Fig. 4 is processed by the display section (133) and is shown to a user as a graphical 
image, as is shown in Fig. 5. The HTML tag begins with the start tag <XXXX> and ends 
with the end tag </XXXX>. Thus, the system can recognize the types of respective tags 
and can extract (parse) them (Col. 1 1 , lines 10-15; Col. 1 1 , lines 20-25; compare with 
Claim 7 (and similarly Claim 23), "... parsing the document to identifying a present 
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of a selected tag, wherein a set of text is associated with the selected tag"). 
Noguchi also teaches producing an embedded command file (data structure). The 
embedded command file is produced by the HTML analyzer (125) using two 
procedures. The HTML file shown in Fig. 4 is temporarily converted into a form shown 
in Table 1 (note in Table 1 how the tags are converted to voice attributes, which include 
speed, pitch, volume, and intonation). Unnecessary information, such as "<html>" or 
"<IMG SRC^'yamato.gif'V is removed from the HTML file, and the invalid start tag is 
converted into voice attribute information based on the text attribute/voice attribute 
conversion table (conversion table 143). The end tag is converted into an embedded 
command to return the voice attribute, which has been changed by the related start tag, 
to a default value (Col. 11, lines 63-67; Col. 12, lines 1-4). Noguchi also teaches an 
example of a tentative file (see Table 2). Tables 1 and 2 represent lists of commands. 
Compare to Claim 7 (and similarly Claim 23), "... responsive each selected tag 
identified to form an identified tag, placing the set of text in a data structure; 
responsive to placing the set of text in the data structure, associating an 
emphasis level with the set of text; responsive to a completion of parsing the 
document, presenting each set of text in the data structure using an associated 
emphasis level"). 

In regard to dependent Claim 8 (and similarly dependent Claim 24), Noguchi 
teaches (a) producing a word list managing information for specifying the plurality of 
special words and position information for specifying a position of the plurality of special 
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words in the sentence (Col. 5, lines 3-7; compare to Claim 8 (and similarly Claim 24), 
"... the data structure is one of a list, a linked list, and a database"). 

In regard to dependent Claim 9 (and similarly dependent Claim 25), Noguchi 
teaches that in Fig. 4 is shown a sample of an HTML file used in the preferred 
embodiment of the present invention. In this embodiment, the HTML file is input to 
produce a sentence including an embedded command. As is shown in Fig. 4, the HTML 
file in a text form includes tags, <TITLE>, <H1> <H2>, <H3>, <H4>, <H5> and <H6> 
(Col. 11, lines 10-15; compare to Claim 9 (and similarly Claim 25), "... the set of text is 
at least one word"). 

In regard to dependent Claim 10 (and similarly dependent Claim 26), Noguchi 
teaches Table 1 (Col. 12, lines 9-33; compare to Claim 10 (and similarly Claim 26), "... 
the emphasis level is at least one of a volume level and a type of intonation"). 

In regard to dependent Claim 1 1 (and similarly dependent Claim 27), Noguchi 
does not explicitly teach that the text in the data structure is presented prior to 
presenting other text in the document However, Noguchi does teach that in response 
to using certain predefined function keys, one can maneuver about the page in a 
number of ways, some of which would effectively read the selected tags before reading 
any other text in the web page (see Cols. 17-19). Therefore, one of ordinary skill in the 
art at the time of invention would have found it obvious to have been able to quickly 
locate portions of the text, such as hyperlinks before hearing any of the remaining text 
providing the benefit of audibly locating links to other web sites. 
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In regard to dependent Claim 12 (and similarly dependent Claim 28), Noguchi 
teaches that the present invention enables a visually impaired user to freely and easily 
control hypertext. A voice synthesis program orally reads hypertext on the Internet. In 
synchronization with this reading, the system focuses on a link keyword that is most 
closely related to the location where reading is currently being performed. When an 
instruction "jump to link destination" is input (by voice or with a key), the program control 
can jump to the link destination for the link keyword that is being focused on. Further, 
the reading of only a link keyword can be instructed (see Abstract; compare to Claim 12 
(and similarly Claim 28), "... the text is presented audibly"). 

In regard to independent Claim 33, Noguchi teaches a voice synthesis system 
(100) that includes a central processing unit (CPU) (1) and a memory (4). The CPU (1) 
and the memory (4) communicate with a hard disk drive (13) as an auxiliary storage 
device via a bus (2). A floppy disk drive (or a driver for an MO or a CD-ROM) (20) 
communicates with a bus (2) via a floppy disk controller (19). A floppy disk (or a 
medium, such as an MO or a CD-ROM) is inserted into the floppy disk drive (or a driver 
for an MO or a CD-ROM) (20). On the floppy disk and the hard disk drive (13) and in a 
ROM (14) are stored code for a computer program that sends commands to the CPU 
(1), etc., while interacting with an operating system to carry out the preset invention. 
This code is executed by being loaded into the memory (4). The code for the computer 
program may be compressed, or may be divided into a plurality of code segments and 
stored in a plurality of storage media. The voice synthesis system (100) can be a used 
as a system that includes user interface hardware. The user interface hardware 
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components are, for example, a pointing device (mouse, joystick, etc.) (7) and a 
keyboard (6) used for input, and a display (12) used to provide visual data to a user. A 
printer and a modem can be connected, respectively, via a parallel port (16) and via a 
serial port (15). The voice synthesis system (100) can communicate with another 
computer via the serial port (15) and the modem, or via a communication adaptor (18) 
(Col. 8, lines 20-44, Fig. 1 ; compare to Claim 33, "a bus system; a communications 
unit connected to the bus system; a memory connected to the bus system, 
wherein the memory includes a set of instructions; a processing unit connected 
to the bus system ..."). Noguchi also teaches that in Fig. 4 is shown a sample of an 
HTML file used in the preferred embodiment of the present invention. In this 
embodiment, the HTML file is input to produce a sentence including an embedded 
command. As is shown in Fig. 4, the HTML file in a text form includes tags, <TITLE>, 
<H1>, <H2>, <H3>, <H4>, <H5> and <H6>. The HTML file in Fig. 4 is processed by the 
display section (133) and is shown to a user as a graphical image, as is shown in Fig. 5. 
The HTML tag begins with the start tag <XXXX> and ends with the end tag </XXXX>. 
Thus, the system can recognize the types of respective tags and can extract (parse) 
them (Col. 11, lines 10-15; Col. 11, lines 20-25). Noguchi also teaches producing an 
embedded command file. The embedded command file is produced by the HTML 
analyzer (125) using two procedures. The HTML file shown in Fig. 4 is temporarily 
converted into a form shown in Table 1 (note in Table 1 how the tags are converted to 
voice attributes, which include speed, pitch, volume, and intonation). Unnecessary 
information, such as "<html>" or "<|MG SRC="yamato.gif">," is removed from the HTML 
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file, and the invalid start tag is converted into voice attribute information based on the 
text attribute/voice attribute conversion table (conversion table 143). The end tag is 
converted into an embedded command to return the voice attribute, which has been 
changed by the related start tag, to a default value (Col. 11, lines 63-67; Col. 12, lines 1- 
4). Noguchi also teaches an example of a tentative file (see Table 2). Compare to 
Claim 33, "... wherein the processing unit executes the set of instructions to 
parse the document to identifying a presence of a selected tag indicating an 
emphasis level, wherein text is associated with the selected tag; and, audibly 
present the text using the emphasis level"). Noguchi does not explicitly teach that ... 
prior to presenting other text within the document in response to an identification of the 
presence of the selected tag. However, Noguchi does teach that in response to using 
certain predefined function keys, one can maneuver about the page in a number of 
ways, some of which would effectively read the selected tags before reading any other 
text in the web page (see Cols. 17-19). Therefore, one of ordinary skill in the art at the 
time of invention would have found it obvious to have been able to quickly locate 
portions of the text, such as hyperlinks before hearing any of the remaining text 
providing the benefit of audibly locating links to other web sites. 

In regard to dependent Claim 34, Noguchi teaches in Fig. 1 a single bus (2). 
Compare to Claim 34, "... the bus system is a single bus". 

In regard to dependent Claim 35, Noguchi teaches that the present invention can 
be implemented by a normal personal computer (PC), a workstation, or a combination 
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of them (hence more than one bus) (Col. 8, lines 53-55; compare to Claim 35, "... the 
bus system includes a primary bus and a secondary bus"). 

In regard to dependent Claim 36, Noguchi teaches that the present invention can 
be implemented by a normal personal computer (PC), a workstation, or a combination 
of them (hence more than one processor) (Col. 8, lines 53-55; compare to Claim 36, "... 
the processing unit includes a plurality of processors"). 

In regard to dependent Claim 37, Noguchi teaches that the present invention 
may be implemented as a client/server system wherein a client machine is connected 
by a LAN to a server machine via Ethernet or a token ring (Col. 9, lines 9-12; compare 
to Claim 37, "... the communications unit is one of a modem and Ethernet 
adapter"). 

In regard to independent Claim 38, Noguchi teaches a voice synthesis system 
100 that includes a central processing unit (CPU) (1) and a memory (4). The CPU (1 ) 
and the memory (4) communicate with a hard disk drive (13) as an auxiliary storage 
device via a bus (2). A floppy disk drive (or a driver for an MO or a CD-ROM) (20) 
communicates with a bus (2) via a floppy disk controller (19). A floppy disk (or a 
medium, such as an MO or a CD-ROM) is inserted into the floppy disk drive (or a driver 
for an MO or a CD-ROM) (20). On the floppy disk and the hard disk drive (13) and in a 
ROM (14) are stored code for a computer program that sends commands to the CPU 
(1), etc., while interacting with an operating system to carry out the preset invention. 
This code is executed by being loaded into the memory (4). The code for the computer 
program may be compressed, or may be divided into a plurality of code segments and 
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stored in a plurality of storage media. The voice synthesis system (100) can be a used 
as a system that includes user interface hardware. The user interface hardware 
components are, for example, a pointing device (mouse, joystick, etc.) (7) and a 
keyboard (6) used for input, and a display (12) used to provide visual data to a user. A 
printer and a modem can be connected, respectively, via a parallel port (16) and via a 
serial port (15). The voice synthesis system (100) can communicate with another 
computer via the serial port (15) and the modem, or via a communication adaptor (18) 
(Col. 8, lines 20-44, Fig. 1 ; compare to Claim 38, "a bus system; a communications 
unit connected to the bus system; a memory connected to the bus system, 
wherein the memory includes a set of instructions; a processing unit connected 
to the bus system, wherein the processing unit executes the set of instructions"). 
Noguchi also teaches that the communication control (110) controls communications 
with the Web server (60), as is shown in Fig. 3. In Fig. 3, first, information specifying a 
URL input at the user input section (127) is received via the input/output controller 
(121). Based on this information, the Web server (60) is accessed, and an HTML (Hyper 
Text Markup Language) file is received from the Web server (60) (Col. 10, lines 56-62; 
compare with Claim 38, "... receive a document"). Noguchi also teaches that in Fig. 4 
is shown a sample of an HTML file used in the preferred embodiment of the present 
invention. In this embodiment, the HTML file is input to produce a sentence including an 
embedded command. As is shown in Fig. 4, the HTML file in a text form includes tags, 
<TITLE>, <H1 >, <H2>, <H3>, <H4>, <H5> and <H6>. The HTML file in Fig. 4 is 
processed by the display section (133) and is shown to a user as a graphical image, as 
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is shown in Fig. 5. The HTML tag begins with the start tag <XXXX> and ends with the 
end tag </XXXX>. Thus, the system can recognize the types of respective tags and can 
extract (parse) them (Col. 1 1 , lines 10-15; Col. 1 1 , lines 20-25; compare with Claim 38, 
"... parse the document to identifying a present of a selected tag, wherein a set of 
text is associated with the selected tag' 7 ). Noguchi also teaches producing an 
embedded command file. The embedded command file is produced by the HTML 
analyzer (125) using two procedures. The HTML file shown in Fig. 4 is temporarily 
converted into a form shown in Table 1 (note in Table 1 how the tags are converted to 
voice attributes, which include speed, pitch, volume, and intonation). Unnecessary 
information, such as "<html>" or "<|MG SRC="yamato.gif>," is removed from the HTML 
file, and the invalid start tag is converted into voice attribute information based on the 
text attribute/voice attribute conversion table (conversion table 143). The end tag is 
converted into an embedded command to return the voice attribute, which has been 
changed by the related start tag, to a default value (Col. 11, lines 63-67; Col. 12, lines 1- 
4). Noguchi also teaches an example of a tentative file (see Table 2). Compare to 
Claim 38, "... place the set of text in a data structure in response each selected 
tag identified to form an identified tag; and, associate an emphasis level with the 
set of text in response to placing the set of text in data structure using an 
associated emphasis level in response to a completion of parsing the 
document"). 

In regard to independent Claim 39, Noguchi teaches that in Fig. 4 is shown a 
sample of an HTML file used in the preferred embodiment of the present invention. In 



Application/Control Number: 09/784,590 Page 15 

Art Unit: 2176 

this embodiment, the HTML file is input to produce a sentence including an embedded 
command. As is shown in Fig. 4, the HTML file in a text form includes tags, <TITLE> 
<H1> <H2>, <H3>, <H4> <H5> and <H6>. The HTML file in Fig. 4 is processed by the 
display section (133) and is shown to a user as a graphical image, as is shown in Fig. 5. 
The HTML tag begins with the start tag <XXXX> and ends with the end tag </XXXX>. 
Thus, the system can recognize the types of respective tags and can extract (parse) 
them (Col. 1 1, lines 10-15; Col. 11, lines 20-25; compare with Claim 39, "...first 
instructions for parsing the document to identifying a presence of a selected tag 
indicating an emphasis level, wherein text is associated with the selected tag"). 
Noguchi also teaches that the voice synthesizer (165) generates a voice signal in 
accordance with the voice parameter produced by the parameter generator (163) (Col. 
14, lines 43-45). Since the voice synthesization is performed in response to the type of 
a special word that is included in the text, a user can identify the type of the special 
word nearly by listening and without using vision, so that he or she can understand the 
contents of the text (Col. 14, lines 51-56; compare to Claim 39, "... second 
instructions, responsive to an identification of the presence of the selected tag, 
for audibly presenting the text using the emphasis level"). Noguchi does not 
explicitly teach that . . . prior to presenting other text within the document However, 
Noguchi does teach that in response to using certain predefined function keys, one can 
maneuver about the page in a number of ways, some of which would effectively read 
the selected tags before reading any other text in the web page (see Cols. 17-19). 
Therefore, one of ordinary skill in the art at the time of invention would have found it 
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obvious to have been able to quickly locate portions of the text, such as hyperlinks 
before hearing any of the remaining text providing the benefit of audibly locating links to 
other web sites. 

In regard to independent Claim 40, Noguchi teaches that the communication 
control (110) controls communications with the Web server (60), as is shown in Fig. 3. 
In Fig. 3, first, information specifying a URL input at the user input section (127) is 
received via the input/output controller (121). Based on this information, the Web server 
(60) is accessed, and an HTML (Hyper Text Markup Language) file is received from the 
Web server (60) (Col. 10, lines 56-62; compare with Claim 40, "... first instructions for 
receiving a document' 9 ). Noguchi also teaches that in Fig. 4 is shown a sample of an 
HTML file used in the preferred embodiment of the present invention. In this 
embodiment, the HTML file is input to produce a sentence including an embedded 
command. As is shown in Fig. 4, the HTML file in a text form includes tags, <TITLE>, 
<H1>, <H2>, <H3> <H4> <H5> and <H6>. The HTML file in Fig. 4 is processed by the 
display section (133) and is shown to a user as a graphical image, as is shown in Fig. 5. 
The HTML tag begins with the start tag <XXXX> and ends with the end tag </XXXX>. 
Thus, the system can recognize the types of respective tags and can extract (parse) 
them (Col. 11, lines 10-15; Col. 11, lines 20-25; compare with Claim 40, "... second 
instructions for parsing the document to identifying a present of a selected tag, 
wherein a set of text is associated with the selected tag"). Noguchi also teaches 
producing an embedded command file (data structure). The embedded command file 
is produced by the HTML analyzer (125) using two procedures. The HTML file shown in 
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Fig. 4 is temporarily converted into a form shown in Table 1 (note in Table 1 how the 
tags are converted to voice attributes, which include speed, pitch, volume, and 
intonation). Unnecessary information, such as "<html>" or "<IMG SRC="yamato.gif">," 
is removed from the HTML file, and the invalid start tag is converted into voice attribute 
information based on the text attribute/voice attribute conversion table (conversion table 
143). The end tag is converted into an embedded command to return the voice attribute, 
which has been changed by the related start tag, to a default value (Col. 1 1 , lines 63-67; 
Col. 12, lines 1-4). Noguchi also teaches an example of a tentative file (see Table 2). 
Tables 1 and 2 represent lists of commands. Compare to Claim 40,"... third 
instructions, responsive each selected tag identified to form an identified tag, for 
placing the set of text in a data structure; fourth instructions, responsive to 
placing the set of text in the data structure, for associating an emphasis level with 
the set of text"). Noguchi also teaches that the voice synthesizer (1 65) generates a 
voice signal in accordance with the voice parameter produced by the parameter 
generator (163) (Col. 14, lines 43-45). Since the voice synthesization is performed in 
response to the type of a special word that is included in the text, a user can identify the 
type of the special word nearly by listening and without using vision, so that he or she 
can understand the contents of the text (Col. 14, lines 51-56; compare to Claim 40, "... 
and fifth instructions, responsive to a completion of parsing the document, for 
presenting each set of text in data structure using an associated emphasis 
level"). 
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Conclusion 



Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to James H Blackwell whose telephone number is 703- 
305-0940. The examiner can normally be reached on Mon-Fri. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Joseph H Feild can be reached on 703-305-9792. The fax phone number 
for the organization where this application or proceeding is assigned is 703-872-9306. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). 



James H. Blackwell 
03/25/04 




